Multi-touch input discrimination

ABSTRACT

Techniques for identifying and discriminating between different types of contacts to a multi-touch touch-screen device are described. Illustrative contact types include fingertips, thumbs, palms and cheeks. By way of example, thumb contacts may be distinguished from fingertip contacts using a patch eccentricity parameter. In addition, by non-linearly deemphasizing pixels in a touch-surface image, a reliable means of distinguishing between large objects (e.g., palms) from smaller objects (e.g., fingertips, thumbs and a stylus) is described.

CROSS-REFERENCES TO RELATED APPLICATIONS

This application is a divisional of U.S. application Ser. No. 14/668,859, filed Mar. 25, 2015 and published on Jul. 16, 2015 as U.S. Publication No. 2015-0199070, which is a divisional of U.S. application Ser. No. 14/444,848, filed Jul. 28, 2014, issued on May 5, 2015 as U.S. Pat. No. 9,024,906, which is a continuation of U.S. application Ser. No. 13/970,526, filed Aug. 19, 2013, issued on Jul. 29, 2014 as U.S. Pat. No. 8,791,921, which is a continuation of U.S. application Ser. No. 13/560,719, filed Jul. 27, 2012, issued on Sep. 10, 2013 as U.S. Pat. No. 8,531,425, which is a continuation of U.S. application Ser. No. 13/353,273, filed Jan. 18, 2012, issued on Aug. 14, 2012 as U.S. Pat. No. 8,243,041, which is a continuation of U.S. application Ser. No. 11/756,211, filed May 31, 2007, issued on Mar. 6, 2012 as U.S. Pat. No. 8,130,203, which is a continuation-in-part of U.S. application Ser. No. 11/619,464, filed Jan. 3, 2007, issued on Dec. 21, 2010 as U.S. Pat. No. 7,855,718, the contents of which are incorporated by reference herein in their entirety for all purposes.

BACKGROUND

The invention relates generally to data input methods and devices for electronic equipment and, more particularly, to methods and devices for discriminating between various inputs to a multi-touch touch-surface input device.

There currently exist many types of input devices for performing operations with an electronic system. These operations often correspond to moving a cursor and/or making selections on a display screen. Illustrative electronic systems include tablet, notebook, desktop and server computer systems, personal digital assistants, audio and video control systems, portable music and video players and mobile and satellite telephones. The use of touch pad and touch screen systems (collectively “touch-surfaces’) has become increasingly popular in these types of electronic systems because of their ease of use and versatility of operation.

One particular type of touch-surface is the touch screen. Touch screens typically include a touch panel, a controller and a software driver. The touch panel is characteristically an optically clear panel with a touch sensitive surface that is positioned in front of a display screen so that the touch sensitive surface is coextensive with a specified portion of the display screen's viewable area (most often, the entire display area). The touch panel registers touch events and sends signals indicative of these events to the controller. The controller processes these signals and sends the resulting data to the software driver. The software driver, in turn, translates the resulting data into events recognizable by the electronic system (e.g., finger movements and selections).

Unlike earlier input devices, touch-surfaces now becoming available are capable of simultaneously detecting multiple objects as they approach and/or contact the touch-surface, and detecting object shapes in much more detail. To take advantage of this capability, it is necessary to measure, identify and distinguish between the many kinds of objects that may approach or contact such touch-surfaces simultaneously. Prior art touch-surface systems (including their supporting software and/or circuitry) do not provide a robust ability to do this. Thus, it would be beneficial to provide methods and devices that identify and discriminate multiple simultaneous hover or touch events such as, for example, two or more closely grouped fingers, palm heels from one or more fingers, fingers from thumbs, and fingers from ears and cheeks.

SUMMARY

In one embodiment the invention provides a method to discriminate input sources to a touch-surface device. One method includes obtaining a proximity image, segmenting the image into a plurality of patches, determining an eccentricity value for each patch, identifying those patches have an eccentricity value greater than a first threshold as a thumb contact. This method may also be used to identify fingertip contacts (i.e., those patches having an eccentricity value less than the first threshold).

Another method includes obtaining a proximity image, segmenting the image to identify a plurality of patches (each patch having one or more pixels and each pixel having a value), reducing the value of each pixel in a non-linear fashion, determining a minor radius value for each patch, identifying those patches having a minor radius value greater than a specified radius as a palm contacts, and using the identified patch to control an operation of a touch-surface device.

In another embodiment, the first and second methods may be combined. Thresholds may comprise constant values, linear functions or non-linear functions. Illustrative operations of a touch-surface device include, but are not limited to, rejecting spurious input and changing the operating mode of the touch-surface device. Illustrative modes changes include, but are not limited to, dimming the device's backlight, putting the device to sleep, waking the device from a low-power state, putting the device into a low power state (e.g., off or “sleep”) and, for mobile telephones, answering calls and terminating calls. One of ordinary skill in the art will recognize that the methods described herein may be organized as one or more program modules, stored in a tangible form (e.g., a magnetic disk), and executed by a programmable control device (e.g., a computer processor).

BRIEF DESCRIPTION OF THE DRAWINGS

FIG. 1 shows, in flowchart form, a multi-touch processing methodology in accordance with one embodiment of the invention.

FIG. 2 shows, in flowchart form, a patch irregularity calculation in accordance with one embodiment of the invention.

FIG. 3 shows an example plot of empirically determined data illustrating patch minor radii's ability to discriminate between large touch-surface contacts (cheeks, for example) and other touch-surface contacts (fingertips and thumbs, for example).

FIG. 4 shows an example plot of empirically determined data illustrating patch minor radii's ability to discriminate between palm contacts and other touch-surface contacts (e.g., fingertips and thumbs).

FIG. 5 shows an example plot of empirically determined data illustrating the use of patch eccentricity to distinguish thumb contacts from fingertip contacts.

FIG. 6 shows an example plot of empirically determined data illustrating the use of patch eccentricity to distinguish large contacts (e.g., cheeks and palms) from small contacts (e.g., thumbs and fingertips) and, further, to distinguish between thumb contacts from fingertip contacts.

FIG. 7 shows a plot of empirically determined data illustrating a patch irregularity measure's ability to discriminate between ear contacts and other touch-surface contacts (e.g., fingertips, thumbs and cheeks).

FIG. 8 shows, in flowchart form, far-field operations in accordance with one embodiment of the invention.

FIG. 9 shows, in flowchart form, a patch parameterization operation using squashed pixel values in accordance with one embodiment of the invention.

FIG. 10 shows an example plot of empirically determined data illustrating the use of squashed pixel values to distinguish between palm contacts from other types of contacts.

FIG. 11 shows, in block diagram form, a touch-surface device in accordance with one embodiment of the invention.

DETAILED DESCRIPTION

Methods and devices to detect and discriminate between multiple simultaneous close approaches or touches to a touch-surface are described. The following embodiments are presented to enable any person skilled in the art to make and use the invention as claimed and are provided in the context of mutual capacitance touch-surface devices. Variations using other types of touch-surfaces such as force or optical sensing touch-surfaces will be readily apparent to those skilled in the art. Accordingly, the claims appended hereto are not intended to be limited by the disclosed embodiments, but are to be accorded their widest scope consistent with the principles and features disclosed herein.

As previously noted, recent touch-surface input devices are capable of simultaneously detecting multiple objects as they approach and/or contact the touch-surface. For a hand-held multi-touch touch-surface device that may be put into a pocket, purse, or held against the head (e.g., portable music player, portable video player, personal digital assistant or mobile phone), detecting when the device is being clasped on the way into or out of the pocket, against the body, or against the head is very useful for: input rejection (ensuring that touch-surface input signals generated as a result of these actions are not mistaken for normal finger/stylus touches); operational mode transitions (e.g., dimming the device's backlight, putting the device to sleep and waking the device from a low-power state); and, for mobile telephones, answering calls (e.g., when the device is brought near, but not necessarily touching the head) and/or terminating calls (e.g., when the unit is placed into a pocket or purse).

Each sensing element (aka “pixel”) in a two dimensional array of sensing elements (i.e., a touch-surface) generates an output signal indicative of the electric field disturbance (for capacitance sensors), force (for pressure sensors) or optical coupling (for optical sensors) at the sensor element. The ensemble of pixel values represents a “proximity image.” As described herein, various embodiments of the invention address the ability to detect and discriminate between touch-surface signals (represented as a proximity image) resulting from, for example, the types of actions identified above.

Referring to FIG. 1, multi-touch processing methodology 100 in accordance with one embodiment of the invention begins with the acquisition of proximity image data (block 105). Because the acquired data is usually a superposition of information (indicating an object close to or in contact with the touch-surface) fixed offsets (due to circuitry baselines) and noise (e.g., radio frequency interference), an initial adjustment to acquired pixel data may be made to compensate for sensor element baseline activity. For example, on multi-touch device initialization and/or when being brought out of a low-power mode (e.g., sleep), one or more images may be captured. By assuming these initial images include no surface contacts, they may be used to provide the sensor's baseline. Averaging over multiple sequential images (using, for example, infinite or finite impulse response filters) has been found to provide more accurate baseline values. These baseline values may be subtracted from each subsequently captured image to provide a proximity image for use in ongoing image processing steps. In another embodiment, baseline pixel values may be slowly adjusted over time to compensate for temperature or static charge. In addition, the initial baseline values may need to be adjusted if, in fact, touch-surface contacts were present at start-up. In yet another embodiment, a plurality of image samples may be acquired each at a different sensor element driving frequency. For each pixel in these images, the mean or median of subtracted samples (i.e., between the captured baseline and information images) may be combined to create an initial (typically signed) image in accordance with block 105. For noise that occasionally generates large outlier pixel values (“spiky” noise), other rank-order filters may be useful. As noted in FIG. 1, proximity image data resulting from operations in accordance with block 105 is denoted [PROX].

Next, [PROX] image data feeds other processing blocks that may operate sequentially or in parallel with one another (blocks 110, 115 and 120). It has been found that filtering or smoothing a proximity image (block 115) prior to segmentation (block 125) reduces the number of spurious peaks and thus helps reduce over segmentation. In one embodiment of block 115, each pixel value may be averaged with its nearest neighbor pixels in accordance with a discrete diffusion operation. If this approach is employed, it has been found beneficial to insert a “border” around the captured image so that there is a value with which to average the pixels at the edge of the captured image. For example, a one (1) pixel border may be added to the [PROX] image—where each “border” pixel is assigned a value corresponding to the image's “background” (e.g., zero). In another embodiment, both temporal (e.g., obtaining multiple images over a period of time) and spatial (e.g., averaging neighbor pixels) smoothing operations may be used. Multiple smoothing operations may be beneficial if the captured pixel data is particularly noisy. As noted in FIG. 1, image data resulting from operations in accordance with block 115 is denoted [SMTH].

While [PROX] image pixel values are typically zero or positive in response to an object contacting the touch-surface (aka, a “grounded” object), background noise or objects close to but not touching the touch-surface (aka “ungrounded” objects) may produce an image some of whose pixel values are negative. Background noise may be static or vary with circuit temperature, touch-surface moisture, or other factors. Noisy, negative pixels can cause excessive jitter in centroid and other patch measurements (see discussion below regarding block 135). To compensate for this, [PROX] image pixel values may be confined to a desired, typically positive, range (block 110). Subtracting the noise threshold helps reduce centroid jitter induced from pixels that wander around (above and below) the noise threshold in successive image frames. As noted in FIG. 1, image data resulting from operations in accordance with block 110 is denoted [CNST]. In one embodiment, all pixels whose values are less than a background noise threshold are set to zero. In another embodiment, a noise-threshold is subtracted from each pixel value and the result is forced to be non-negative, as shown in Table 1.

TABLE 1 Illustrative Pixel Constraint Technique On a pixel-by-pixel basis: If [PROX] < (Noise Threshold) [ONST] = (Background Value) Else [ONST] = [PROX] − (Noise Threshold)

In one embodiment, the noise-threshold value is set to between 1 and 3 standard deviations of the noise measured at each pixel and the background-value is set to zero. One skilled in the art will recognize that other values are possible and that the precise choice of values depends, inter alia, on the type of sensor element used, the actual or expected level of pixel noise and the multi-touch device's operational environment. For example, the noise threshold may be set to a specified expected value on a per-pixel basis or a single value may be used for all pixels in an image. In addition, pixel noise values may be allowed to vary over time such that thermal and environmental effects on sensor element noise may be compensated for.

Touch-surface contacts typically show up as grouped collections of “active” pixel values, where each region of fleshy contact (e.g. finger, palm, cheek, ear or thigh) is represented by a roughly elliptical patch of pixels.

By analyzing an image's topography, image segmentation operations can identify distinct pixel patches that correspond to touch-surface contacts (block 125). In one embodiment, bottom-up, ridge-hiking algorithms may be used to group pixels that are part of the same watershed around each peak pixel—each watershed group or pixel patch corresponds to a touch-surface contact. In another embodiment, top-down search algorithms may be used to identify pixel patches surrounding each peak pixel, starting from the peak, searching outward and stopping at valleys. As part of the image segmentation process, one-dimensional patches may be culled from the identified patches in that they generally result from isolated noise spikes or failure of an entire row or column of sensor elements and/or associated circuitry. In addition, because large contacts such as palms and elongated thumbs may produce multiple peaks in a proximity image (due to noise or non-uniform signal saturation, for example), multiple peaks in the image can grow into multiple, split patches. To account for this phenomenon, multiple detected patches may be merged to produce a reduced number of patches for further processing. Heuristic or empirically determined rules may, for example, be applied to accomplish this. For example, two separately identified patches may be merged when the saddle point along their shared border is not “very deep”—e.g., when the saddle magnitude is more than 60% to 80% of the two patches' peak pixel values. As noted in FIG. 1, identified patches resulting from operations in accordance with block 125 are denoted [P1, P2, Pn].

Analysis shows that noise from pixels on the periphery of a patch, far from the center or peak pixel, can cause more jitter in calculated centroid (center-of-‘mass’) measurements than the same amount of noise from central pixels. This phenomenon applies to other statistically-fitted patch parameters such as major/minor radii and orientation as well. This jitter can be a particularly serious problem for the smooth tracking of hovering objects because hovering objects do not generally induce strong central pixels, leaving the peripheral pixels with even greater influence on the centroid measurement. However, completely leaving these peripheral pixels out of a patches' centroid calculations would discard potentially useful information about the position, size, and shape of the patch. It is further noted that performing patch parameterization on diffused images may reduce noise from peripheral pixels, but standard spatial filtering processes also cause swelling and distortion of patch shape, cause adjacent patches to spread into one another and other effects that bias centroid and ellipse radii measurements in particular. Thus, a technique is needed that minimizes the amount of noise from patch periphery pixels without strongly distorting patch shape and ensuing measurements.

In accordance with one embodiment of the invention, therefore, patch peripheral pixel values may be selectively reduced, down-scaled or dampened (block 130). Generally, patch centroid determination may be improved by selectively down-scaling patch peripheral pixels that are fairly weak and whose neighbors are very weak. More specifically, in one embodiment calibrated image pixel values (e.g., in [CNST]) whose corresponding smoothed value (e.g., in [SMTH]) falls within a specified range defined by a lower-limit and an upper-limit are reduced in proportion to where the smoothed value falls within that range. Lower and upper limits are chosen empirically so that only those pixel values that are relatively weak (compared to patch peak values and background noise) are manipulated. It has been found that: if the lower-limit is set too low, the patch will “bloom” from background pixels that happen to have positive noise; if the lower-limit is set too high, the patches' centroid position will have a spatially periodic bias toward sensor element centers (e.g., capacitive electrode plate centers); if the upper-limit is not sufficiently higher than the lower-limit, periphery dampening will not provide any significant centroid jitter reduction benefits; and if the upper-limit is too high, all patch pixels besides the patches' peak pixel will be affected, again biasing determination of the patches' centroid toward sensor element centers. In accordance with one embodiment of the invention, the lower-limit is set, on a pixel-by-pixel basis, to approximately twice the background noise standard deviation and the upper-limit is set to approximately four times the background noise standard deviation (with the background value typically being zero). In another embodiment, the lower-limit is set to a value indicative of the “average” or “expected” noise across all pixels in the proximity image. In some embodiments, the noise value may change dynamically to reflect changing operational conditions (see comments above). As noted in FIG. 1, an image whose peripheral patch pixels have been dampened in accordance with block 130 is denoted [CNST′]. In one embodiment, peripheral patch pixels are dampened as shown in Table 2.

TABLE 2 Illustrative Peripheral Patch Pixel Dampening For each pixel in a patch: If [SMTH] < (Lower Limit)  [CNST′] = (Background Value) Else if [SMTH] > (Upper Limit)  [CNST′] = [CNST] Else   $\left\lbrack {CNST}^{\prime} \right\rbrack = {\frac{\lbrack{SMTH}\rbrack - {{Lower}\mspace{14mu}{Limit}}}{{{Upper}\mspace{14mu}{Limit}} - {{Lower}\mspace{14mu}{Limit}}} \times \lbrack{CNST}\rbrack}$

Patch peripheral pixel dampening such as described above is equally applicable to touch-surfaces that provide one-dimensional proximity images. For example, projection scan touch-surfaces provide an output value (or signal) for each row and column of sensor elements in a touch-surface. In these types of touch-surfaces, a “patch” comprises a plurality of values, where each value represents a row or column measurement. The values at the ends of these patches (i.e., the peripheral values) may benefit from noise dampening as described here.

For certain touch-surface input devices such as a telephone, the ear and earlobe may contact the touch-surface sooner or more often than the cheek during calls. Unfortunately, earlobe patches can be very close in size to finger and thumb patches—but should, nevertheless, not cause spurious finger-button activations during a call. In accordance with one embodiment of the invention, a measurement of patch irregularity is defined that does not look for any specific ear (patch) shape, but rather indicates a general roughness, non-roundness or folds in the pixel patch (block 120). That is, if a patches' irregularity measure is above a specified threshold, the contact is identified as an irregular object (e.g., not a cheek, finger or palm), otherwise the patch is identified as not an irregular object (e.g., a cheek, finger or palm).

Referring to FIG. 2, patch irregularity determination methodology 120 begins with the computation of a dispersion image (block 200). In general, the dispersion image (denoted [DISP] in FIG. 2) may be any high-pass filtered version of the initial proximity image [PROX]. In one embodiment, the [DISP] image is generated using a form of unsharp masking as follows: [DISP]=[PROX]−[SMTH]  EQ. 1

Next, the total energy for each patch [P1, P2, Pn] is computed (block 205). In one embodiment, for example, a patches' total energy may be calculated by summing the square of each pixel value in the patch. This may be expressed mathematically as follows:

$\begin{matrix} {{{Total}\mspace{14mu}{Energy}\mspace{14mu}{in}\mspace{14mu}{Patch}\mspace{14mu} p} = {E_{p} = {\sum\limits_{i,{j\mspace{14mu}{in}\mspace{14mu} p}}\left\lbrack {DISP}_{{\lbrack i\rbrack}{\lbrack j\rbrack}}^{2} \right\rbrack}}} & {{EQ}.\mspace{14mu} 2} \end{matrix}$

As noted in FIG. 2, total patch energy values resulting from operations in accordance with block 205 are denoted [E1, . . . En].

The total energy between adjacent pixels in a patch is then determined (block 210). To reduce the effect of energy spikes for pixel patches straddling an edge, the summations below should neglect (i.e., assume a value of zero) contributions from pixels whose neighboring pixels are at the image's border, see EQ. 3. For the same reason, the summations below should ignore contributions from pixels whose neighboring pixels are from a different patch.

$\begin{matrix} {\begin{matrix} {\begin{matrix} {{Total}\mspace{14mu}{Spatial}} \\ {{Energy}\mspace{14mu}{for}\mspace{14mu}{Patch}\mspace{14mu} p} \end{matrix} = {SE}_{p}} \\ {= {\begin{pmatrix} {{\sum\limits_{i,{j\mspace{14mu}{in}\mspace{14mu} p}}\begin{pmatrix} {\left\lbrack {DISP}_{{\lbrack i\rbrack}{\lbrack j\rbrack}} \right\rbrack -} \\ \left\lbrack {DISP}_{{\lbrack{i + 1}\rbrack}{\lbrack j\rbrack}} \right\rbrack \end{pmatrix}^{2}} +} \\ {{\sum\limits_{i,{j\mspace{14mu}{in}\mspace{14mu} p}}\begin{pmatrix} {\left\lbrack {DISP}_{{\lbrack i\rbrack}{\lbrack j\rbrack}} \right\rbrack -} \\ \left\lbrack {DISP}_{{\lbrack{i - 1}\rbrack}{\lbrack j\rbrack}} \right\rbrack \end{pmatrix}^{2}} +} \\ {{\sum\limits_{i,{j\mspace{14mu}{in}\mspace{14mu} p}}\begin{pmatrix} {\left\lbrack {DISP}_{{\lbrack i\rbrack}{\lbrack j\rbrack}} \right\rbrack -} \\ \left\lbrack {DISP}_{{\lbrack i\rbrack}{\lbrack{j + 1}\rbrack}} \right\rbrack \end{pmatrix}^{2}} +} \\ {\sum\limits_{i,{j\mspace{14mu}{in}\mspace{14mu} p}}\begin{pmatrix} {\left\lbrack {DISP}_{{\lbrack i\rbrack}{\lbrack j\rbrack}} \right\rbrack -} \\ \left\lbrack {DISP}_{{\lbrack i\rbrack}{\lbrack{j - 1}\rbrack}} \right\rbrack \end{pmatrix}^{2}} \end{pmatrix} \div 4}} \end{matrix}\quad} & {{EQ}.\mspace{14mu} 3} \end{matrix}$

The sum is divided by 4 because each pixel gets counted once for each direction in the proximity image (left, right, up and down). As noted in FIG. 2, total patch spatial energy values resulting from operations in accordance with block 210 are denoted [SE1, SEn]. Next, the energy associated with each patches' peak pixel is determined (block 215) as follows:

$\begin{matrix} {{{Peak}\mspace{14mu}{Energy}\mspace{14mu}{for}\mspace{14mu}{Patch}\mspace{14mu} p} = {{PE}_{p} = {\max\limits_{i,{j\mspace{14mu}{in}\mspace{14mu} p}}\left( \lbrack{DISP}\rbrack \right)^{2}}}} & {{EQ}.\mspace{14mu} 4} \end{matrix}$

As noted in FIG. 2, peak patch energy values resulting from operations in accordance with block 215 are denoted [PE1, PEn].

$\begin{matrix} {\begin{matrix} {{{Irregularity}\mspace{14mu}{Measure}\mspace{14mu}{for}\mspace{14mu}{Patch}\mspace{14mu} p} = {IM}_{p}} \\ {= \frac{{SEp} - {PEp}}{Ep}} \end{matrix}\quad} & {{EQ}.\mspace{14mu} 5} \end{matrix}$

Finally, an irregularity measure for each patch is calculated (block 220). In one embodiment, the irregularity measure is defined as the ratio of a patches' spatial energy minus its peak energy to the patches' total energy:

In another embodiment, the irregularity measure may be based on the proximity image as a whole. That is, the entirety of the dispersion image (i.e., all pixels) may be treated as a single “patch” for purposes of generating an irregularity measure value. One benefit to this approach is that abnormal touch-surface surface conditions may be detected, and responded to, prior to segmentation operations in accordance with block 125 (see FIG. 1). Illustrative abnormal touch-surface surface conditions include, but are not limited to, liquid (e.g., water or sweat) on the touch-surface or multiple irregular objects in close proximity to or in contact with the touch-surface (e.g., coins and/or keys). When these conditions are detected, it may be beneficial to acquire new sensor element baseline values. In addition, if multiple touch-surface sensor sampling frequencies are employed an irregularity measure may be computed at each of the frequencies. If one or more of the computed irregularity measure values is greater than a specified threshold as discussed above, the sampling frequencies associated with the above-threshold values may be deemed to be affected by an excessive amount of noise and ignored (e.g., radio frequency noise). Periodic determination of frequency-dependent irregularity measures in this manner may be used to detect when such noise sources occur and when they disappear. For example, due to a touch-surface devices operating environment changes.

In general, an oddly shaped collection of pixels (i.e., a patch) can require a relatively large set of numbers to define its boundary and signal value at each pixel within the patch. To reduce the computational complexity of identifying, distinguishing and tracking touch events, however, it is advantageous to characterize patches identified in accordance with block 125 with as few numbers as practical. Because most patches from flesh contact tend to have an elliptical shape, one approach to patch parameterization is to fit an ellipse to each patch. One benefit of this approach is that an ellipse is completely described by a relatively small collection of numbers—its center coordinates, major and minor axis lengths, and major axis orientation.

Referring again to FIG. 1, using this approach known centroid or center of-mass computations may be used to parameterize each patch (block 135). In general, a patches' centroid may be determined using these techniques and the [CNST′] image (see block 130). In addition, the [CNST′] image may be used to generate patch covariance matrices whose Eigenvalues identify a patches' major and minor radii and whose Eigenvectors identify the patches' orientation. For contact discrimination operations (see discussion below regarding block 140), the following patch characteristics are also computed:

$\begin{matrix} {\mspace{79mu}{{{Total}\mspace{14mu}{Signal}\mspace{14mu}{for}\mspace{14mu}{Patch}\mspace{14mu} p} = {\sum\limits_{i,{j\mspace{14mu}{in}\mspace{14mu} p}}\left\lbrack {CNST}_{{\lbrack i\rbrack}{\lbrack j\rbrack}} \right\rbrack}}} & {{EQ}.\mspace{14mu} 6} \\ {{{Signal}\mspace{14mu}{Density}\mspace{14mu}{for}\mspace{14mu}{Patch}\mspace{14mu} p} = \frac{\left( {{Total}\mspace{14mu}{Signal}\mspace{14mu}{for}\mspace{14mu}{Patch}\mspace{14mu} p} \right)}{\left( {{Geometric}\mspace{14mu}{Mean}\mspace{14mu}{Radius}\mspace{14mu}{of}\mspace{14mu}{Patch}\mspace{14mu} p} \right)}} & {{EQ}.\mspace{14mu} 7} \end{matrix}$ In another embodiment, patch signal density may be approximated by:

$\begin{matrix} {{{Signal}\mspace{14mu}{Density}\mspace{14mu}{for}\mspace{14mu}{Patch}\mspace{14mu} p} = \frac{\left( {{Total}\mspace{14mu}{Signal}\mspace{14mu}{for}\mspace{14mu}{Patch}\mspace{14mu} p} \right)}{\left( {{Number}\mspace{14mu}{of}\mspace{14mu}{Pixels}\mspace{14mu}{in}\mspace{14mu}{Patch}\mspace{14mu} p} \right)}} & {{EQ}.\mspace{14mu} 8} \\ {\mspace{79mu}{{{Patch}\mspace{14mu}{Eccentricity}} = \frac{\left( {{Patch}\mspace{14mu}{Major}\mspace{14mu}{Axis}} \right)}{\left( {{Patch}\mspace{14mu}{Minor}\mspace{14mu}{Axis}} \right)}}} & {{EQ}.\mspace{14mu} 9} \end{matrix}$

Prior art techniques to discriminate between objects that actually contact a touch-surface from those that are merely hovering above it have relied upon a patches' total signal parameter (see, for example, EQ. 6). This approach, however, is very dependent upon the size of the object being identified. That is, prior art techniques that threshold on a patches' total signal value generally only work well for objects of a single size. For example, a total patch signal threshold selected to identify a fingertip contact could trigger detection of a thumb or palm when those objects are far above the touch-surface. Such a situation can lead to the mis-activation of keys, buttons or other control elements, the activation of control elements prior to surface contact and the mis-identification of patches (e.g., identifying a patch actually caused by a palm as a thumb).

In contrast, a discrimination technique in accordance with one embodiment of the invention uses a patches' signal density parameter (see, for example, EQs. 7 and 8). It has been found that this approach provides a robust means to distinguish objects that contact the touch-surface from those that are held or hovering above the surface—regardless of the object's size. For instance, the same density threshold can discriminate surface contact for fingers (adult and children), thumbs, palms and cheeks.

If the patch signal density parameter is normalized such that a firm fingertip contacting the touch-surface produces a peak value of 1, then a lightly brushing contact typically produces values slightly greater than 0.5 (e.g., half the normalized value) while a hovering object would produce a patch density value generally less than 0.5. It will be recognized that what constitutes “slightly greater” or “slightly less” is dependent upon factors such as the type of sensor elements used and their physical layout. Accordingly, while the precise determination of a threshold value based on patch signal density will require some experimentation, it would be well within the purview of an artisan of ordinary skill with benefit of this disclosure.

It has also been determined that fingernail touches produce patch signal density values generally less than approximately 0.5. This is because the nonconductive fingernail holds the conductive finger flesh more than approximately 1 millimeter above the touch-surface. Accordingly, a threshold operation based on patch signal density is also a reliable means for discriminating between fleshy fingertip touches and back-of-fingernail touches.

With patch parameterization complete, the various types of touch-surface contacts may be distinguished (block 140). Using the parameters identified above, it is possible to robustly and reliably distinguish large objects (e.g., cheeks and palms) form other objects (e.g., fingers and thumbs), irregular objects (e.g., ears) from regular objects (e.g., fingers, thumbs, cheeks and palms) and finger-clasp actions (e.g., when a user claps a multi-touch touch-surface device to put it into or withdraw it from a pocket). Identification of and discrimination between these types of touch-surface inputs permits an associated device to be controlled in a more robust manner. For example, in one embodiment detection of a large object may be used to transition the device from one operational state (e.g., off) to another (e.g., on). In another embodiment, input identified as the result of a large or irregular object, which might normally cause a state transition, may be safely ignored if in one or more specified states. For example, if a touch-surface telephone is already in an “on” or “active” state, identification of a large or irregular object may be ignored.

As previously noted, it can be advantageous to distinguish large objects (e.g., cheeks and palms) from small objects (e.g., fingertips), regardless of whether the objects are hovering a few millimeters above the touch-surface or are pressed firmly against the surface. It has been found that a contact's minor radius measure provides a robust discriminative measure to accomplish this. If a patches' minor radius exceeds a specified threshold, the contact can reliably be classified as a cheek—as opposed to a finger or thumb, for example. This same measurement can also detect a nearby leg (e.g., thigh) through a few millimeters of fabric (e.g. when a device is inserted in the pocket with its touch-surface facing the body). This measurement has been found to be so robust that if other patches appear on the surface with smaller minor radii (e.g., from an earlobe), they may be safely ignored. Referring to FIG. 3, illustrative empirical data is shown that illustrates the distinction between cheek contacts 300 and other contacts 305 (e.g., fingertips and thumbs) based on patch minor radii. While the exact values for patch contacts may vary from sensor to sensor and population to population, it is clear from FIG. 3 that threshold 310 may be made anywhere between approximately 11 millimeters and approximately 15 millimeters. (In this and the following data plots, patch signal density values are normalized to 1 for a fully contacting fingertip.) While threshold 310 is described by a constant value (i.e., dependent only upon patch minor radius), this is not necessary. For example, threshold 310 may be described by a linear or non-linear relationship between multiple parameters such as patch minor-radius and patch signal density (see discussion below regarding FIG. 4).

A similar size testing may be performed using a patches' major or geometric mean radius, the minor-radius discrimination described here has been found to be superior because it is better able to discriminate between thumbs or flattened fingers. (Flattened fingers may produce major radii as large as a cheek major radius, but their minor radii are typically no larger than a normal fingertip touch.)

It will be

$\left. \sqrt{\left( {{patch}\mspace{14mu}{major}\mspace{14mu}{axis}\mspace{14mu}{radius}} \right)\left( {{patch}\mspace{14mu}{minor}\mspace{14mu}{axis}\mspace{14mu}{radius}} \right)} \right)$ recognized that distinguishing a palm contact from fingertip or thumb contacts can be especially difficult because the patch radii resulting from a palm contact for people with small hands may approach the patch radii caused by thumb or fingertip contacts for people with large hands. These types of contacts may also be distinguished in accordance with the invention using the patch minor radius parameter. Referring to FIG. 4, illustrative empirical data is shown that illustrates the distinction between palm contacts 400 and other contacts 405 (e.g., fingertips and thumbs) based on patch minor radii. It has been found that patch signal density values tend to be low for hovering contacts of any size, and saturate at a level independent of object size as the object presses firmly onto the touch-surface. Thus, the palm versus other object decision threshold 410 may be reduced for contacts with lower signal density because hovering or lightly touching fingers produce lower minor radii than firmly touching fingers, whereas palms tend to produce large minor radii even when hovering. Accordingly, decision threshold 410 may be represented by a straight curve with a small positive slope. While the exact values for patch contacts will vary as noted above, it is clear from FIG. 4 that threshold 410 may be made to distinguish palm contacts from other contacts. Using this approach, there is virtually no risk that a hovering palm (a contact that typically produces a patch signal density value similar to that of a touching finger) will mistakenly be interpreted as a cursor move or button activation (e.g., a “click” event).

Referring to FIG. 5, one technique in accordance with the invention uses patch eccentricity (see EQ. 9) to distinguish fingertip contacts from thumb contacts (shown as “other contacts in FIGS. 3 and 4). As shown, in one embodiment thumb contacts 500 may be easily distinguished from fingertip contacts 505 via constant threshold 510. In another embodiment, thumb and fingertip contacts may be distinguished based on a linear or non-linear function. It has been found that fingertips typically have eccentricity values between 1.0 and 1.5 while thumbs have eccentricity values greater than 1.5. The use of eccentricity in this manner has unexpectedly been found to be more reliable in correctly identifying thumbs then prior art techniques (which typically use total signal and separation angles between other fingers).

It will be noted that FIG. 5 includes only thumb and fingertip contacts. This implies that larger contacts such as cheeks (see FIG. 3) and palms (see FIG. 4) have already been identified and, therefore, removed from consideration. In accordance with another embodiment of the invention, minor radius may be used to first discriminate large contacts 600 (e.g., cheeks 300 and palms 400) from thumb contacts 500 and fingertip contacts 505 with first threshold 605. In still another embodiment, the threshold separating thumbs and fingertip contacts from large contacts may be linear or non-linear. Thumb contacts 500 may then be distinguished from fingertip contacts 505 via threshold 510 in accordance with FIG. 5. While the exact value of thresholds 510 and 605 will be implementation dependent, they will nevertheless be straight-forward to determine by one of ordinary skill in the art having the benefit of this disclosure.

Being able to distinguish thumbs from other fingertips allows a larger number of input patterns (e.g., finger/thumb motions across a device's multi-touch touch-surface) to be uniquely recognized. For example, the motion of a fingertip across a touch-surface in a first pattern (e.g., a circle) may generate an action in accordance with a first command, while that same gesture using a thumb could result in a different action. Being able to distinguish thumbs from other fingertips also permits one to identify a cluster of contacts as left-handed or right-handed. That is, if the identified thumb contact is left of cluster center, this indicates a contact from fingers of a right hand. Similarly, if the identified thumb contact is right of cluster center, this indicates a contact from the fingers of a right hand. This knowledge may, in turn, be used to enlarge the number of distinct commands a user may generate from hand input. For example, a given contact pattern using the right hand may effect a first action (e.g., open a file), while the same pattern using the left hand may effect a second action (e.g., close a file).

Ear and earlobe contacts can generate patches that are roughly the same size as those generated by fingers and thumbs. It has been found, however, that the creases, ridges, and generally rough topography of the ear do produce proximity images unique from fingers and thumbs, at least if the imaging sensor (i.e., touch-surface) covers a significant portion of the ear (i.e. not just the fleshy lobule). The irregularity measure described above is one way to characterize contact roughness (see EQ. 5). This permits a robust means to discriminate between contacts due to ears and earlobes from contacts due to fingers, thumbs, cheeks, thighs and palms. It has been found that the defined irregularity measure tends to give values between 1.0 to 2.0 for ear and earlobe contacts while regular (e.g., smooth) contacts attributable to fingers, thumbs, palms and cheeks give values less than about 1.0. Referring to FIG. 7, illustrative empirical data is shown that illustrates the distinction between ear contacts 700 and other contacts 705 (e.g., fingertips, thumbs and cheeks) based on the above defined irregularity measure. In one embodiment, threshold 710 comprises a linear step-like or spline structure with a first level at an irregularity measure of between approximately 1.0 to 1.2 and a second level at approximately between 1.1 and 1.2. In another embodiment, a single linear function having a positive slope may be used. In yet another embodiment, higher level functions may be used to segregate the various contact types. As noted above, while the exact values for patch contacts may vary from those shown in FIG. 7, it is clear that most rough object contacts may be distinguished from most smooth or regular object contacts using the defined irregularity measure—where the exact nature or form of a decision threshold (e.g., threshold 710) is dependent upon the precise implementation, operational goals and capabilities of the target multi-touch device.

In one embodiment, successive proximity images (aka “frames”) are used to track objects as they move across a touch-surface. For example, as an object is moved across a touch-surface, its associated patch(es) may be correlated through overlap calculations. That is, patches identified in successive images that overlap in a specified number of pixels (or fraction of patch pixels) may be interpreted to be caused by the same object. In such embodiments, the maximum patch minor radius over the life of the tracked contact may be compared to the thresholds discussed above (e.g., thresholds 310 in FIG. 3, 410 in FIG. 4, 510 in FIG. 5, 605 in FIG. 6 and 710 in FIG. 5). This approach ensures that a palm contact, for example, does not lose its palm identity should its minor radius temporarily fall below the decision threshold (e.g., 410). It is further noted that if a decision threshold is not a constant value (e.g., 310) but rather some curve (e.g., 410 and 710), it may be advantageous to apply a density-correction to the instantaneous minor radius prior to the maximum minor radius accumulation operation described here.

When taking a multi-touch device in and out of a pocket, or otherwise generally handling it, users should have the freedom to clasp their hand around it without producing spurious input. Such finger-clasps can be detected via any one of the following criteria:

Identification (via block 125 in FIG. 1) of five, six or more distinct surface contacts. (For a touch-surface the size of a deck of cards, this many fingertips won't normally fit on the surface, but since the phalange joints of each flattened finger may get segmented into more than one contact patch, two or three flattened fingers may generate five or more contact patches.)

Two, three or more contact patches are identified and the major radius of at least two exceed approximately 15 millimeters to 18 millimeters. Since cheeks and other large body parts normally produce just one patch with large major radius, the requirement for two or three large patches prevents this test from triggering on a cheek, leg or chest. Also, the requirement for multiple large major radii prevents this test from triggering on a couple fingertips accompanied by a long thumb laid flat against the surface.

In another embodiment of the invention, multi-touch processing methodology may include far-field processing. As used herein, far-field processing refers to the detection and processing associated with bodies (e.g., fingers, palms, cheeks, ears, thighs, . . . ) that are close to (e.g., less than one millimeter to more than a centimeter) but not in contact with the touch-surface. The ability to detect far-field objects may be beneficial in touch-surface devices that, during normal use, are brought into close proximity to a user. One example of such a device is a telephone that includes a touch-surface for user input (e.g., dialing).

Referring to FIG. 8, in one embodiment initial far-field processing may be performed after proximity image data is acquired. That is, after operations in accordance with block 105 in FIG. 1. If the far-field measurement is designed to remain negative in the absence of any object near the touch-surface, and only become positive in the presence of a large object, a first step subtracts a small noise factor from the initially acquired proximity image to create a negative-background far-field image (block 800): Negative Far-Field Image=[PROXJ−(Noise Factor)  EQ. 10

In one embodiment, the noise factor may be set to between approximately 1 and 2 standard deviations of the average noise measured or expected over the entire image. This will cause most pixels in the resulting negative far-field image to be slightly negative rather than neutral in the absence of any touch-surface contact. As noted in FIG. 8, the negative far-field image resulting from operations in accordance with block 800 is denoted [NFAR].

Next, each pixel in the [NFAR] image is saturated to the highest level expected from an object hovering a few millimeters from the touch-surface (block 805). In one embodiment, the resulting far-field saturation image (denoted [SFAR] in FIG. 8) is generated as shown in Table 3.

TABLE 3 Illustrative Far-Field Saturation Operations For each pixel in the initial far-field image: If [NFAR] > (Far-Field Saturation Limit) [SFAR] = (Far-Field Saturation Limit) Else [SFAR] = [NFAR]

Since one goal of far-field operations is to be sensitive to large numbers of pixels only slightly activated (e.g., having small positive values), without being overwhelmed by a few strongly active pixels (e.g., having large positive values), the saturation limit value should be less than the peak pixel value from fingers or thumbs hovering within approximately 1 to 2 millimeters of the touch-surface, but not so low as to cause the resulting [SFAR] image to lose to much information content. While the precise far-field saturation limit value will vary from implementation to implementation (due to differences in sensor element technology and associated circuitry), it has been determined empirically that a suitable value will generally lie between +3 standard deviations and +6 standard deviations of noise associated with the initial far-field image. (Again, this noise may be on a per-pixel, or whole image basis.)

If the initial proximity image [PROX] contains a significant amount of noise, it may be beneficial to filter the [SFAR] image (block 810). In one embodiment, a finite impulse response filter technique may be used wherein two or more consecutive [SFAR] images are averaged together. In another embodiment, an infinite impulse response filter technique may be used to generate a smoothed image. It will be recognized that an infinite impulse response filter generates a weighted running average (or auto-regressive) image. In one embodiment, for example, an infinite impulse response filter combines the current far-field saturated image (e.g., [SFAR]new) with the immediately prior far-field saturated image (e.g., [SFAR]_(prior)) in a one-third to prior, two-thirds ratio. As noted in FIG. 8, a filtered far-field saturated image generated in accordance with block 810 is denoted [FARF].

Following image segmentation operations in accordance with block 125 (see FIG. 1), a weighted average of non-linearly scaled background pixel values may be used to generate a scalar far-field indicator value (FAR-FIELD) in accordance with the invention as follows:

$\begin{matrix} {{{FAR}\text{-}{FIELD}} = \frac{\sum\limits_{{{background}\mspace{14mu} i},j}{{{Eneg}\left( \lbrack{FARF}\rbrack_{{\lbrack i\rbrack}{\lbrack j\rbrack}} \right)} \times \left\lbrack {LOC}_{{\lbrack i\rbrack}{\lbrack j\rbrack}} \right\rbrack}}{\sum\limits_{{{background}\mspace{14mu} i},j}\left\lbrack {LOC}_{{\lbrack i\rbrack}{\lbrack j\rbrack}} \right\rbrack}} & {{EQ}.\mspace{14mu} 11} \end{matrix}$ where the ENeg( ) function non-linearly amplifies pixel values below a threshold (e.g., zero) and [LOC] represents a pixel weighting mechanism. As indicated in EQ. 11, only proximity image background pixels contribute to the computed FAR-FIELD value. That is, pixels identified as belonging to a patch during image segmentation operations are excluded during far-field measurement operations.

In one embodiment, the ENeg0 function disproportionately emphasizes the contributions from background pixels as follows:

$\begin{matrix} {{{Eneg}\left( {{pixel}\mspace{14mu}{value}} \right)} = \left\{ \begin{matrix} {{pixel}\mspace{14mu}{value}} & {for} & {0 \leq {{pixel}\mspace{14mu}{value}} \leq B} \\ {2 \times {pixel}\mspace{14mu}{value}} & {for} & {\left( {{- B} \div 2} \right) \leq {{pixel}\mspace{14mu}{value}} < 0} \\ {B + \left( {3 \times {pixel}\mspace{14mu}{value}} \right)} & {for} & {{{pixel}\mspace{14mu}{value}} < \left( {{- B} \div 2} \right)} \end{matrix} \right.} & {{EQ}.\mspace{14mu} 12} \end{matrix}$ where B represents a far-field saturation limit value. Empirically determined, B is chosen to permit a small number of negative pixels to cancel out a finger or thumb-sized patch of positive pixels. In this way, only a nearly full coverage cheek-sized patch of positive pixels, plus a small remainder of neutral/background pixels, can produce a strongly positive far-field measurement.

While not necessary, disproportionately emphasizing the contributions from background pixels in accordance with EQs. 11 and 12 permits the FAR-FIELD measurement to be more selective for bodies large enough to positively affect most of a touch-surface's pixel (e.g., cheeks and legs), while not being overly influenced by medium-sized objects (e.g., hovering thumbs). For example, if a hovering thumb causes half of a touch-surface's sensor elements to have a slightly above-background pixel value, disproportionately emphasizing the half that remain below background will keep the measured FAR-FIELD value below zero indicating no large object is “near” the touch-surface (e.g., within 1 to 3 centimeters). In another embodiment, background pixels may be linearly combined (e.g., summed).

As noted above, [LOC] represents a pixel weighting mechanism. In general, there is one value in [LOC] for each pixel present in the touch-surface. If it is desired to consider all touch-surface pixels equally, each value in the [LOC] image may be set to 1.0 (or some similar constant value). For hand-held form-factors selectivity for large bodies may be improved, however, by lowering the weights near the bottom and side edges (for example, to values between 0.1 and 0.9). Doing this can lessen false-positive contributions from a hand whose fingers wrap around the device during (clasping) operations. In mobile phone form-factors, to retain sensitivity to ear and cheek far-fields, the weights along the top edge (where thumbs and fingers are less likely to hover or wrap) may be kept at full strength.

Returning now to FIG. 1 at block 140, when far-field measurements are taken into account during contact discrimination operations, a FAR-FIELD value greater than a specified threshold (e.g., zero) indicates a large “near by” object has been detected. As previously noted, this information may be used to transition the touch-surface device into a specified mode (e.g., on, off or low power). In addition, far-field measurements may be combined with other measurements (e.g., the irregularity measure) to provide improved ear detection. For example, when a touch-surface is partly against an ear and also hovering a centimeter or two from a cheek, a weak ear pixel patch may be segmented in accordance with block 125 at the top of the screen. Meanwhile, the middle and bottom of the touch-surface would only be affected by the cheek's far-field. Even if the FAR-FIELD measurement as taken outside the ear patch is not strong enough to exceed the specified far-field threshold on its own, the FAR-FIELD value can be added or otherwise combined with (weak) patch density and irregularity measure indicators such that the sum or combination surpasses an ear detection threshold.

In addition, one or more proximity sensors may be positioned above the touch-surface's top edge or around, for example, a telephone's receiver opening. Illustrative proximity sensors of this type include, but are not limited to, active infrared-reflectance sensors and capacitance-sensitive electrode strips. In a mobile telephone form-factor, when the device is held such that the receiver is centered on the ear canal, ear ridges may trigger the proximity sensor. Meanwhile the earlobe may cause a small pixel patch in the top portion of the touch-surface. Discrimination operations in accordance with block 140 could decide that when a pixel patch at the top of the touch-surface is accompanied by any significant receiver proximity trigger, the pixel patch must be an ear, not a finger. In another embodiment, the same conditions but with a significant FAR-FIELD value for the lower portion of the touch-surface (indicating a hovering cheek) may be used to trigger detection of an ear at the top of the touch-surface. Generally speaking, one or more of signal density (see EQs. 7 and 8), patch irregularity (see EQ. 5), FAR-FIELD measurement (see EQ. 11) and proximity sensor input may be combined (e.g., a weighted average) so that ear detection can trigger when multiple indicators are weakly active, or just one indicator is strongly active. Finally, it is noted that contact discrimination parameters such as a patches' centroid, minor axis radius, patch irregularity (EQ. 5), patch signal density (EQs. 7 and 8), far-field (EQ. 11) and proximity sensor input (if available) may be (low-pass) filtered to help counteract their often sporadic nature. This may be particularly beneficial if the filters employ adaptive time constants that rise quickly in response to rising input values, but decay more slowing when input values drop and/or are missing.

In accordance with still another embodiment of the invention, it has been found beneficial to “squash” measured pixel values prior to determining some patch parameters. As used herein, “squashed” means to deemphasize pixel values in a nonlinear fashion. Illustrative squashing functions include the square root, the third root, one-over-the-value-squared, piece-wise linear functions (e.g., splines), etc. Referring to FIG. 9, each pixel in the [CNST′] image is squashed (block 900) prior to being used to generate patch covariance matrices whose Eigenvalues identify a patches' major and minor radii and whose Eigenvectors identify the patches' orientation (block 905). In addition, and as described above, the [CNST′] image is also used to determine patch centroid (block 910) and total signal (block 915) parameters. The parameters generated in this manner may be used in any of the discrimination methods described in connection with block 140 (see FIG. 1)

One benefit of using squashed pixel values in accordance with FIG. 9, is that patch shape and size measurements (e.g. radii) do not become dominated by fully covered central pixels as flesh touches and presses onto the touch-surface. The measurements remain more consistent whether a flesh object is hovering, lightly touching or fully pressed onto the surface. Another benefit of using squashed pixel values in accordance with FIG. 9, is that it tends to simplify the discrimination of certain patches over a range of patch signal densities. For example, when patch radius is computed based on squashed pixel values, palm contacts 1000 are easily distinguished from other contacts 1005 using a constant threshold function 1010. This is in contrast to linear threshold 410 illustrated in FIG. 4. Constant thresholds, in general, are easier and faster to implement than linear or non-linear thresholds. Another benefit of using squashed pixel values in accordance with FIG. 9 is that one may eliminate the density correction to minor radius values prior to thresholding operations as discussed above in connection with FIG. 4.

Referring to FIG. 11, a touch-surface device 1100 of the type described herein is shown in block diagram form. As used herein, a touch-surface device is any device that receives user input from a multi-touch capable touch-surface component (i.e., an input unit that provides user input in the form of a proximity image). Illustrative touch-surface devices include, but are not limited, to tablet computer system, notebook computer systems, portable music and video display devices, personal digital assistants, mobile telephones and portable video and audio players.

As illustrated, touch-surface element 1105 includes sensor elements and necessary drive and signal acquisition and detection circuitry. Memory 1110 may be used to retain acquired proximity image information (e.g., [PROX] image data) and by processor 1115 for computed image information (e.g., patch characterization parameters). Processor 1115 represents a computational unit or programmable control device that is capable of using the information generated by touch-surface element 1105 to determine various metrics in accordance with FIGS. 1, 2, 8 and 9. In addition, external component 1120 represents an entity that uses the generated information. In the illustrative embodiment, external component 1120 may obtain information from processor 1115 or directly from memory 1110. For example, processor 1115 could maintain a data structure in memory 1110 to retain indication of, for example, large body contact status, large body far-field status, irregular object indication status, proximity sensor status (if utilized), flat finger clasp status and normal finger touch status. In one embodiment, each status may be indicated by a single Boolean value (i.e., a flag).

Various changes in the materials, components, circuit elements, as well as in the details of the illustrated operational methods are possible without departing from the scope of the following claims. It will be recognized, for example, that not all steps identified in FIG. 1 need be performed while others may be combined and still others divided into more refined steps. By way of example, in one embodiment patch peripheral pixel noise is not suppressed (see block 130). In another embodiment, patch peripheral pixel noise suppression is employed but no patch irregularity measure is made (see block 120). In still another embodiment, both patch peripheral pixel noise suppression and patch irregularity measures are determined and used.

In yet another embodiment, proximity image pixel values are squashed while in another embodiment, squashed pixel values are not used. When squashed pixel values are used, they may be used to determine patch radius and orientation values only. They may also be used to determine other parameters such as patch density and total patch signal parameters.

For embodiments that do not employ peripheral patch pixel noise reduction techniques, patch parameterization operations in accordance with block 135 use the [CNST] image and not the [CNST′] image as discussed above (see Table 2). In addition, patch parameterization operations in accordance with block 135 do not need to rely on statistical ellipse fitting. They could instead sum patch perimeter pixels and compare the obtained value to all patch pixels or attempt polygonal fits. Further, calibration operations (see Tables 1 and 2) may be delayed until, or made part of, image segmentation operations (block 125). In addition, it may be beneficial for purposes of image segmentation to mark pixels that are at, or have been set to, the background level (e.g., during operations in accordance with block 110). It is also noted that because the criteria for identifying a finger clasp are orthogonal to large body contact detection (see discussion above in [0051]), flat finger clasps may be used as a distinct gesture commanding a particular operation like locking the screen, going to sleep, or terminating a telephone call. If peripheral patch pixel noise reduction techniques are not used, pixel squashing techniques in accordance with FIGS. 9 and 10 may still be employed using the [CNST] image.

With respect to illustrative touch-surface device 1100, touch-surface element 1105 may incorporate memory (e.g., 1110) and/or processor (e.g., 1115) functions. In addition, external component 1120 may represent a hardware element (e.g., a host processor) or a software element (e.g., a driver utility).

Finally, acts in accordance with FIGS. 1, 2, 8 and 9 may be performed by a programmable control device executing instructions organized into one or more program modules. A programmable control device may be a single computer processor, a special purpose processor (e.g., a digital signal processor, “DSP”), a plurality of processors coupled by a communications link or a custom designed state machine. Custom designed state machines may be embodied in a hardware device such as an integrated circuit including, but not limited to, application specific integrated circuits (“ASICs”) or field programmable gate array (“FPGAs”). Storage devices suitable for tangibly embodying program instructions include, but are not limited to: magnetic disks (fixed, floppy, and removable) and tape; optical media such as CD-ROMs and digital video disks (“DVDs”); and semiconductor memory devices such as Electrically Programmable Read-Only Memory (“EPROM”), Electrically Erasable Programmable Read-Only Memory (“EEPROM”), Programmable Gate Arrays and flash devices. 

The invention claimed is:
 1. A method to identify an input to a touch-surface device, comprising: obtaining a proximity image from an array of sensing elements; segmenting the proximity image to identify one or more patches; generating a proximity sensor signal when an object is proximate a proximity sensor, the proximity sensor distinct from the array of sensing elements; determining a signal density value for at least one of the one or more patches; and using the signal density value and the proximity sensor signal to determine an object adjacent to or in contact with the touch-surface device; and wherein each patch has a plurality of pixels and each pixel has a pixel value and the signal density value is determined by dividing the sum of all pixel values in a patch by a radius of the patch.
 2. The method of claim 1, wherein the radius of the patch comprises a geometric mean radius of the patch.
 3. The method of claim 1, further comprising performing a control operation of the touch-surface device based on the determining of the object adjacent to or in contact with the touch-surface touch surface device.
 4. The method of claim 3, wherein the control operation comprises causing the touch-surface device to change an operational state.
 5. The method of claim 3, wherein the control operation comprises performing a specified function.
 6. The method of claim 3, wherein the control operation comprises ignoring the object contacting the touch-surface.
 7. The method of claim 1, wherein the touch-surface device comprises one of the devices selected from the group consisting of: a tablet computer system, a hand-held computer system, a portable music player system, a portable video player system.
 8. The method of claim 1, wherein the touch-surface device comprises a mobile telephone.
 9. The method of claim 1, wherein using the signal density value and the proximity sensor signal for determining the object adjacent to or in contact with the touch-surface device comprises, in part, utilizing threshold values of the signal density to discriminate when an object is hovering over the touch-surface device or in contact with the touch-surface device.
 10. A program storage device, readable by a programmable control device, comprising instructions stored thereon for causing the programmable control device to perform the method of claim
 1. 11. A touch-surface device, comprising: a memory for storing a proximity image of a touch surface of the touch-surface device, the proximity image obtained from an array of sensing elements; a processor operative for: segmenting the proximity image to identify one or more patches; generating a proximity sensor signal when an object is proximate a proximity sensor, the proximity sensor distinct from the array of sensing elements; determining a signal density value for at least one of the one or more patches; and using the signal density value and the proximity sensor signal for determining an object adjacent to or in contact with the touch-surface device; and wherein each patch has a plurality of pixels and each pixel has a pixel value and the signal density value is determined by dividing the sum of all pixel values in a patch by a radius of the patch.
 12. A touch surface device as recited in claim 11 wherein the radius of the patch comprises a geometric mean radius of the patch.
 13. A touch surface device, comprising: a memory for storing a proximity image of a touch surface of the touch-surface device, the proximity image obtained from an array of sensing elements; a processor operative for: segmenting the proximity image to identify one or more patches; generating a proximity sensor signal when an object is proximate a proximity sensor, the proximity sensor distinct from the array of sensing elements; determining a signal density value for at least one of the one or more patches; and using the signal density value and the proximity sensor signal for determining an object adjacent to or in contact with the touch-surface device; and wherein each patch has a plurality of pixels and each pixel has a pixel value and the signal density value is determined by dividing a sum of all pixel values in a patch by the number of pixels in the patch.
 14. A method to identify an input to a touch-surface device, comprising: obtaining a proximity image from an array of sensing elements; segmenting the proximity image to identify one or more patches; generating a proximity sensor signal when an object is proximate a proximity sensor, the proximity sensor distinct from the array of sensing elements; determining a signal density value for at least one of the one or more patches; and using the signal density value and the proximity sensor signal to determine an object adjacent to or in contact with the touch-surface device; and wherein each patch has a plurality of pixels and each pixel has a pixel value and the signal density value is determined by dividing a sum of all pixel values in a patch by the number of pixels in the patch.
 15. A program storage device, readable by a programmable control device, comprising instructions stored thereon for causing the programmable control device to perform the method of claim
 14. 16. The method of claim 14, wherein using the signal density value and the proximity sensor signal for determining the object adjacent to or in contact with the touch-surface device comprises, in part, utilizing threshold values of the signal density to discriminate when an object is hovering over the touch-surface device or in contact with the touch-surface device.
 17. A method to identify an input to a touch-surface device, comprising: obtaining a proximity image from an array of sensing elements; segmenting the proximity image to identify one or more patches; generating a proximity sensor signal when an object is proximate a proximity sensor, the proximity sensor distinct from the array of sensing elements; determining a signal density value for at least one of the one or more patches; and using the signal density value and the proximity sensor signal to determine an object adjacent to or in contact with the touch-surface device; and wherein each patch has a plurality of pixels and each pixel has a pixel value and the signal density value is determined by dividing the sum of all pixel values in a patch by the area of the patch.
 18. A program storage device, readable by a programmable control device, comprising instructions stored thereon for causing the programmable control device to perform the method of claim
 17. 19. The method of claim 17, wherein using the signal density value and the proximity sensor signal for determining the object adjacent to or in contact with the touch-surface device comprises, in part, utilizing threshold values of the signal density to discriminate when an object is hovering over the touch-surface device or in contact with the touch-surface device. 